Auditory ERB like admissible wavelet packet features for TIMIT phoneme recognition
نویسندگان
چکیده
منابع مشابه
Phoneme recognition in TIMIT with BLSTM-CTC
We compare the performance of a recurrent neural network with the best results published so far on phoneme recognition in the TIMIT database. These published results have been obtained with a combination of classifiers. However, in this paper we apply a single recurrent neural network to the same task. Our recurrent neural network attains an error rate of 24.6%. This result is not significantly...
متن کاملOptimization of Features Parameters for HMM Phoneme Recognition of TIMIT Corpus
Phoneme is the smallest contrastive unit in the sound system of a language. Moreover, it has a meaningful role in speech recognition. In this study, we are interesting for phonemes recognition of Timit database using HTK toolkit for HMM. The main goal is to determine the optimal parameters for the recognizer. For this reason, different speech analysis techniques were operated such as Mel Freque...
متن کاملSpeech Recognition using Wavelet Packet Features
In view of the growing use of automatic speech recognition in the modern society, we study various alternative representations of the speech signal that have the potential to contribute to the improvement of the recognition performance. Specifically, the main targets of the present article are to overview and evaluate the practical importance of some recently proposed, and thus less studied, wa...
متن کاملSpeaker independent phoneme recognition by MLP using wavelet features
Feature extraction is one of the most important tasks in speech recognition system. Most of the speech recognition systems use Short Time Fourier Transform (STFT) for the derivation of features from the spoken utterances. In this paper we try to exploit the higher time–frequency resolution property of Discrete Wavelet Transform (DWT) for extraction of speaker independent features. The features ...
متن کاملWavelet based feature extraction for phoneme recognition
In an effort to provide a more efficient representation of the acoustical speech signal in the pre-classification stage of a speech recognition system, we consider the application of the Best-Basis Algorithm of Coifman and Wickerhauser. This combines the advantages of using a smooth, compactly-supported wavelet basis with an adaptive time-scale analysis dependent on the problem at hand. We star...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Engineering Science and Technology, an International Journal
سال: 2014
ISSN: 2215-0986
DOI: 10.1016/j.jestch.2014.04.004